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ABSTRACT 

The primary sequence of human DNA polymerase a 
deduced from the full-length cDNA contains regions of 
striking similarity to sequences in replicative DNA 
polymerases from Escherichia coli phages PRD1 and T4, 
Bacillus phage 019, yeast DNA polymerase I, yeast 
linear plasmid pGKLl, maize SI mitochondrial DNA, 
herpes family viruses, vaccinia virus, and adenovirus. 
The conservation of these homologous regions across 
this vast phylogenetic expanse indicates that these 
prokaryotic and eukaryotic DNA polymerases may all 
have evolved from a common primordial gene. Based 
on the sequence analysis and genetic results from yeast 
and herpes simplex virus studies, these consensus 
sequences are suggested to define potential sites' that 
subserve essential roles in the DNA polymerase reac- 
tion. Two of these conserved regions appear to partici- 
pate directly in the active site required for substrate 
deoxynucleotide interaction. One region toward the 
carboxyl-terminus has the potential to be the DNA in- 
teracting domain, whereas a potential DNA primase 
interaction domain is predicted toward the amino- 
terminus. The provisional assignment of these domains 
can be used to identify unique or dissimilar features of 
functionally homologous catalytic sites in viral DNA 
polymerases of pathogenetic significance and thereby 
serve to guide more rational antiviral drug design. 
— Wang, T. S.-E; Wong, S. W.; Korn, D Human 
DNA polymerase a: predicted functional domains and 
relationships with viral DNA polymerases. FASEBJ. 3: 
14-21; 1989. 
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Eukaryotic genomic DNA replication is a tightly 
regulated process that involves the orderly and coordi- 
nated interplay of many discrete protein-protein and 
protein-DNA interactions (1-3). A prerequisite for un- 
derstanding this complex process is the identification 



and characterization of the key protein participants, 
among which an essential element is the replicative DNA 
polymerase. DNA polymerase Of is generally accepted 
to be the principal replicative DNA polymerase in eu- 
karyotes (4). Evidence supporting this claim include the 
following: 1) the expression of DNA polymerase a both 
at the transcriptional and posttranscriptional levels (5) 
is regulated, and correlates positively, with the activation 
of quiescent cells to proliferate; 2) a variety of chemical 
inhibitors that block DNA synthesis in vivo also inhibit 
DNA polymerase <x activity (4, 6); 3) a temperature- 
sensitive mutant for DNA synthesis has been identified 
as a DNA polymerase a mutant (7); 4) monoclonal anti- 
bodies that specifically neutralize DNA polymerase a 
inhibit DNA synthesis in permeabilized cells or when 
microinjected into nuclei (8-10); 5) polymerase a is re- 
quired for in vitro SV40 virus DNA replication, which 
is a model system of eukaryotic cell DNA replication 
(11-15). 

Through nearly 3 decades of intense biochemical 
studies in many laboratories, the extremely low abun- 
dance of this polymerase, its apparent structural com- 
plexity, and its extreme lability during conventional bio- 
chemical purification have distinguished this enzyme as 
one of the more problematic in the biochemical litera- 
ture (4). Several years ago, we established a panel of 
murine hybridomas that produce monoclonal antibodies 
of absolute specificity for DNA polymerase a (8). Two 
immunoaffinity protocols were subsequently developed 
that enabled, the purification of presumptively native 
forms of polymerase a from human KB cells and the 
unambiguous identification of the catalytic polypeptide 
(16, 17). By using the reverse genetic approach, a full- 
length cDNA clone of the human DNA polymerase a 
catalytic polypeptide was isolated and characterized 
(18). 

In this review we will summarize and compare the 
primary protein sequence of human DNA polymerase 
o: with those of several prokaryotic and eukaryotic 
DNA polymerases of interest. Based on the primary 
structure comparisons, several regions of similarity are 
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^ lgUtC l" x p onservcd ™S lons of human DNA polymerase oe and other selected DNA polymerases. Amino acid residues 609-1015 from 
human DNA polymerase a were aligned with amino acid residues derived from other DNA polymerases; identical residues of five or more 
sequences between human DNA polymerase a and other DNA polymerases are boxed. Gaps are indicated by dashes, and extensive gaps 
are marked by the number of amino acids contained within the gap. The designated conserved regions are marked by dashed lines under 
the ammo acid residues. Amino acids 998-1005 of human DNA polymerase a arc defined as region I; amino acids 839-878 are region 
U; amnio acids 943-984 are region III; amino acids 609-650 are defined as region IV; regions V and VI are amino acids 1075-1081 and 
^y-y^o, respectively. Amino acid residues that arc uniquely similar among DNA polymerases and use terminal protein priming mecha- 
nisms are outlined by stippling (shaded areas); amino acid residues uniquely similar among polymerases that use or are predicted to use 
UNA pnmase mechanisms are boxed in dashed lines. Portions of this figure are reprinted, with permission, from ref 18 



detected and are predicted to represent functional cata- 
lytic domains of the human enzyme. Provisional as- 
signment of these regions allows an initial assessment of 
similarities and differences with homologous regions in 
the replicative polymerases of several classes of viruses 
of human pathogenetic importance. 

PRIMARY PROTEIN SEQUENCE SIMILARITY 
TO BOTH PROKARYOTIC AND EUKARYOTIC 
REPLICATIVE DNA POLYMERASES 
Comparison of the deduced primary sequence of human 
DNA polymerase a with known sequences of other 
DNA polymerases reveals several regions of striking 



similarities (18). As shown in Fig. 1, from amino acid 
number 609 to 1085 of human DNA polymerase a, 
there are three regions of similarity to the DNA poly- 
merases of Escherichia coli phage PRDl and Bacillus phage 
029 and to two potential polymerases (predicted by the 
polymerase a-like consensus sequences in the ORF) 
encoded in yeast linear plasmid pGKLI and maize 
mitochondrial Si DNA (19-23); five regions of similarity 
with E, coli phage T4 polymerase, and adenovirus poly- 
merase (24, 25); six regions of similarity with the DNA 
polymerases of herpes simplex virus (HSV), cytomega- 
lovirus (HCV), Epstein-Barr virus (EBV), and vaccinia 
virus (26-29); arid extensive similarity with yeast DNA 
polymerase I (18, 30). These regions are designated I to 
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VI according to their extent of similarity, region I being 
the most similar and region VI the least (18). The sig- 
nificance of these regions is underscored by their loca- 
tions on each respective polypeptide (Fig. 2). All six 
regions exhibit similar linear spatial arrangements on 
each polymerase polypeptide in the order of IV-II-VI- 
III-I-V. However, the distances between each of the 
regions are different. 

Human DNA polymerase a and yeast DNA poly- 
merase I are the largest DNA polymerases of this class, 
each with a molecular mass of about 165 kDa, whereas 
the £ coli phage PRD1 polymerase, which has the 
smallest molecular mass of 63 kDa, contains only 
regions I, II, and III (16-19). Regions I, II, and III are 
present in all of these enzymes and appear to comprise 
a set of basic core sequences for this DNA polymerase 
class. Region V is present in all of the eukaryotic poly- 
merases except for the two potential DNA polymerases 
encrypted in yeast plasmid pGKLl and maize Si mito- 
chondrial DNA. Adenovirus DNA polymerase and the 
two potential DNA polymerases lack region VI; coli- 
phage T4 DNA polymerase is the only prokaryotic 
polymerase that has both regions IV and VI (Fig. 2). 
Sequence comparison of human DNA polymerase a 
with other eukaryotic DNA polymerases, such as DNA 
polymerase /?, terminal transferase, or reverse transcrip- 



tase (31, 32), and other prokaryotic DNA polymerases, 
such as E. coli DNA polymerases I and III, reveals no 
significant similarity (33, 34). 

Southern hybridization of two restriction fragments 
of human DNA polymerase a cDNA that contain five 
of the six consensus sequences with genomic DNA 
samples from the calf, rodent, amphibian, insect, and 
plant demonstrates the presence of these consensus 
sequences in all of the examined vertebrates and inver- 
tebrates, but not in plants (35). 

PREDICTED FUNCTIONAL DOMAINS 

In the process of DNA replication, DNA polymerase a 
is involved with other replication proteins in an ordered 
sequence of binding reactions with DNA template, 
primer, and dNTPs (1-3). There are essentially two 
different modes by which DNA synthesis is initiated. E. 
coli phage PRDl, Bacillus phage £29, and adenovirus 
replicate their DNA by a protein priming mechanism 
(1, 2, 20), whereas all other eukaryotic organisms that 
have been studied, including such DNA viruses as 
HSV, HCV, EBV, and vaccinia virus, are thought to 
initiate DNA synthesis not by protein priming but by 
oligoribonucleotide priming mechanisms. In at least 
some instances, the requisite oligonucleotide primers 
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are synthesized by DNA primase. The common finding 
of a tighdy associated DNA primase in DNA polymer- 
ase Of fractions highly purified from a variety of eukary- 
otic organisms suggests that a potential DNA primase 
interaction domain is present on the polymerase a 
molecule. The polymerase a polypeptide must also 
contain DNA interaction domain or domains for 
template-primer recognition and one or more dNTP 
interaction domains. By comparative analysis of the 
primary protein sequence of human DNA polymerase 
of with those of other DNA polymerases, the regions 
responsible for these primase, DNA, and dNTP inter- 
actions can be provisionally identified. 

Predicted DNA primase interaction domain 

The gene of yeast DNA polymerase I was recently 
sequenced (30). A temperature-sensitive mutant of 
yeast polymerase I has been obtained by in vitro muta- 
genesis (36), which, at nonpermissive temperature, can 
complete one round of cell division and DNA replica- 
tion before it arrests. Biochemical characterization in- 
dicates that the mutation leads to a conformational 
change at elevated temperature that affects the stability 



of the DNA primase-DNA polymerase I complex (30, 
36). Sequence comparison of human DNA polymerase 
a and yeast polymerase I demonstrates an overall 31% 
sequence similarity between these two proteins (Fig. 3). 
Analysis of the temperature-sensitive mutant of yeast 
polymerase I identified a point mutation that confers 
the temperature-sensitive phenorype and maps to con- 
sensus region IV as a single amino acid substitution of 
glycine at codon 493 to positively charged arginine. 
This region of yeast polymerase I exhibits near-perfect 
homology with human DNA polymerase a (Fig. 3). 
Although such a single amino acid change might cause 
an indirect charge effect on polypeptide folding, it is 
attractive to consider that the mutation defines a 
domain that is responsible for DNA primase/DNA 
polymerase interaction. 

Comparison of the primary sequences of this class of 
DNA polymerases also reveals the presence of several 
sequences that are unique to those enzymes that repli- 
cate DNA by a protein priming mechanism (Fig. 1). It 
is interesting that these unique sequences are also 
present m the two potential polymerases, i.e., those 
from yeast pGKLl and maize SI mitochondria. The 
implication of these sequences in the priming protein 
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Figure 4. Amino acid sequence of possible DNA interaction region of human DNA polymerase a and yeast DNA polymerase I. Human 
DNA polymerase a amino acids 1244 to 1391 and yeast pol I amino acids 1246 to 1389 are depicted. Identical amino acids are boxed 
in solid lines, and cysteines are boxed in dashed lines. A schematic potential DNA-binding loop is illustrated with most probable DNA- 
binding side chains marked as black circles. The cysteine residues potentially involved in metal ion coordination are depicted in red. 
Human DNA polymerase or sequence is derived from ref 18 and yeast DNA polymerase I sequence from ref 30, with permission. 



interaction remains to be verified by future mutagene- 
sis studies. 

DNA interaction domain 

Sequence comparison of human DNA polymerase a 
and yeast DNA polymerase I reveals a cysteine-rich 
region toward the carboxyl-terminus in both DNA 
polymerases (Fig. 4). From amino acid number 1244 to 
1391 of human DNA polymerase a, there is a region 
potentially able to form a DNA binding motif (Zn- 
finger) with an extended protein loop containing amino 
acids with side chains having the potential to interact 
with the phosphate backbone of DNA (37-39). Figure 4 
presents a schematic depiction of this putative DNA in- 
teraction domain. 

Deoxynucleotide interaction domains 

The presence of the three consensus sequences (regions 
I, II, and III) in the DNA polymerases of bacterio- 
phages, DNA viruses, unicellular fungi, invertebrates, 
and vertebrates suggests that these sequences subserve 
essential functions in polymerase catalysis (18, 35). Se- 
quence comparison of human DNA polymerase a and 
the HSV DNA polymerase (HSV pol) reveals homology 
in all six conserved regions (18), with 87.5% in region 
I; 60% in region II; 47% in region III; 26% in region 
IV; 57% in region V; and 10.5% in region VI (Fig. 1 
and Fig. 5). By marker rescue and marker transfer ex- 



periments, many HSV mutants have been identified 
that contain mutations in the DNA polymerase gene; 
some of these mutations confer altered sensitivity to 
either aphidicolin (Aph) or antiviral drugs such as acy- 
clovir (ACV), bromovinyldeoxyuridine (BVdU), gan- 
ciclovir (DHPG), vidarabine (araA), and phosphono- 
acetic acid (PAA) (28, 29, 40-42). Mutations that con- 
fer altered sensitivity to these drugs (all of which are 
either analogs of dNTPs or pyrophosphate, or competi- 
tive inhibitors, such as aphidicolin) are potential 
markers of the dNTP and PPj binding site or sites. 

Because any single mutation might affect substrate 
or antiviral drug recognition simply by an indirect 
charge effect, it is essential to examine a large number 
of mutants. Sequence analysis of many of these mutants 
from three strains of virus has revealed that a majority 
of the mutations are single amino acid substitutions 
within consensus regions II and III (28, 40, 42) (Fig. 
5). Thus, it is reasonable to predict that these regions 
are sites that are direcdy involved in dNTP binding or 
pyrophosphate hydrolysis (40). However, two muta- 
tions have recendy been identified that do not reside 
within regions II or III. One, in region V, has a single 
amino acid change of asparagine 96 i to lysine, which 
confers altered sensitivity to aph and DHPG; the cor- 
responding residue at this site in human polymerase a 
is aspartate (18, 40). Another mutation, PAA r C 7 localized 
between regions VI and III, confers resistance to phos- 
phonoacetic acid and has a single amino acid substitu- 
tion of proline7g 7 to threonine; interestingly, human 



18 Vol. 3 Jan. 1989 



The FASEB Journal 



WANG ET AL. 



Human DNA pol a 609 
Herpes Simplex I 437 



Human DMA pol a 836 
Kerpes Simple* I 694 



H. E I r n I^R G F PI L A K ft"* 1 °PI D 1 1 Vf^HfNljYGfFjE L 

[vJl £ fos|ejf e«{l|l a[fjm t l(vjk q v g[pJ£ fv t{6Jy[n_i]i m1f|o w 

~"" ~ ~ — — — * I V~ - --- - — 



E V 
P F 



40 



L L 
L L 



lAFpjGlG L 

1q|gJ1k 



10 



20 



V L 

V L 



0 P 
D P 



30 



K V 


G F 


YDKFILLL 


0 F 


N 


S L Y P 


T S 


G F 


HVNPVVVF 


D F 


i 


S L Y P 



Q R 

A EC 



40 

£ FfNllfcl 

a hInIlIcI 



Human DMA pol a f9 T ffl v " ^ A S e|a]q UT E D G E - - Q E Q I P E L P D P S - L IB M 

Herpes Simplex I lfls(T(LSLIR|AOAv£|HLEAGKDYLE I E VGGRRLFFVKAHV r|e|- 



Human DNA pol a 
Herpes Simplex I 



90 

i[l]prei[r]klv 

L[L|S I L L [rJ 0 W L 
VI-- 



VER 
A H 



R K Q 
ft K Q 



100 

V K Q L N K| 
I R S R I P 



110 

D L I L Q Y 
E E A V L L 




Human DMA pol a 
Herpes Simplex I 



ilic|vl3cl 



130 



S M 



y g 

Y G 



c lJgIf 



140 

SYSRFYAKPl 
QHGLLPCLHV 
Ill 



A A 
A A 



V T 

V Tl 



150 



160 

Y Kp!lE]lfL|M HfTlK 



Human DNA pol a 
Herpes Simplex I 



(eJhIvIq kwnlev 

iil V (vj H 28 1 



170 

V 



G 0 
G D 



180 
M I N 
F V L 



MSTNL 
R G L T A 



1015 
899 



Human DNA pol a 1068 [Yj V T Ik) Q E L BTgIl fol I (TrIr 0 M fcl 1085 
Herpes Simplex I 94C IyJ G G [&j M L 1 U V L (oJ Kp.iN E 963 

v 

Figure 5. Conserved sequences between human DNA polymerase a and herpes simplex virus I DNA polymerase, with predicted dNTP 
interaction domains. The six conserved regions are aligned, and identical amino acid residues are boxed. Amino acids that are identical 
to herpes simplex virus mutants with single amino acid substitutions are boxed with solid lines and shaded. Herpes virus polymerase muta- 
tions at positions nonidentical to DNA polymerase a are shown by designating the substituted amino acid in a shaded box with dashed 
lines and the corresponding polymerase a residue with a large black dot. Reprinted from ref 18 and derived from refs 27 and 40 with 
permission. * 



DNA polymerase a has an identical proline residue at 
this position (40). Figure 6 summarizes these predicted 
functional domains. 

NONCONSERVED REGIONS OF VIRAL 
REPLICATIVE ENZYMES THAT ARE 
INVOLVED IN DRUG RECOGNITION 

Several mutants of herpes simplex polymerase with 
altered drug sensitivity have amino acid substitutions 
at loci that are dissimilar in sequence to human DNA 
polymerase or. Recently, Gibbs et al. (40) characterized 
two mutants derived from wild-type HSV type I, strain 
KOS, with altered drug sensitivity phenotypes that 
were mapped to the HSV pol gene in a unique region 
A, from amino acid number 565 to 637 of HSV pol. 
This region, as defined by two mutations (tsD9, glu- 
tamic 597 to lysine; PFA T 2 y alanine 605 to valine) has 
sequence similarity only to other viral DNA polymer- 
ases that exhibit sensitivity to certain antiviral drugs. 



Remarkably, this region is not present in the human 
cell DNA polymerase a sequence. In addition to these 
changes in region A, four other mutations of HSV pol 
within regions II and III exhibit substitutions^ amino 
acid residues that are not identical in human polymer- 
ase a. 

The further characterization of these HSV pol 
mutants in reference to the predicted functional domains 
of human DNA polymerase a should comprise a par- 
ticularly attractive approach to the design of specifically 
targeted antiviral drugs. 
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Figure 6. Schematic representation of the putative functional 
domains of human DNA polymerase a. Reprinted and modified 
from ref 18, with permission. 
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FUTURE PERSPECTIVES 

The genetic information content of chromosomal DNA 
must be copied accurately by DNA polymerases for 
faithful transmission from one generation to the next. 
How eukaryotes accomplish error-free DNA replica- 
tion with a principal replicative DNA polymerase, pol 
a, that does not have an intrinsic exonuclease activity 
for redactory function remains an intriguing question. 
In vitro steady-state kinetics studies have demonstrated 
that polymerase a interacts with its substrates by a 
rigidly ordered terreactant sequential mechanism and 
exhibits respectable fidelity and modest processivity in 
the absence of any exogenous or accessory proteins (17, 
43, 44). Full explication of the molecular basis of this 
complex DNA polymerization reaction remains a for- 
midable challenge. The identification of six consensus 
regions in human DNA polymerase a that are present 
in both prokaryotic and eukaryotic replicative DNA 
polymerases argues that these enzymes may comprise a 
family that has arisen from a common ancestral gene 
(18, 35) and that the homologous regions have been 
conserved to accomplish the elementary steps of the 
DNA polymerization reaction. 

The predicted functional domains for DNA interac- 
tion, primase interaction and dNTP interaction should 
soon be testable by assessing the functional expression 
of recombinant DNA polymerases designed by site- 
specific mutagenesis. Careful examination of the kinetic 
parameters of functional mutants of DNA polymerase 
a in comparison with those of the well-characterized 
wild-type polymerase should shed light on how this 
large and complicated protein molecule recognizes and 
interacts with its several substrates to accomplish 
accurate DNA replication. In addition, the identifica- 
tion of domain or domains required for interaction 
with other accessory DNA replication proteins should 
greatly advance our understanding of the details of 
eukaryotic DNA replication in vitro and in vivo. 

Extensive studies of herpes simplex DNA polymer- 
ase mutants with altered sensitivity to antiviral drugs 
have failed to define any specific, single region on the 
HSV polymerase molecule as the sole binding site for 
these drugs or substrates (40). This suggests that dNTP 
interacting site or sites may require interactions among 
several separate regions on the polymerase molecule, a 
hypothesis that can be examined either by isolation of 
suppressor mutants or by expression of appropriately 
constructed recombinant herpes DNA polymerases, 
followed by physical studies. The expression of enzy- 
matically active recombinant adenovirus DNA poly- 
merase from cloned DNA was recently accomplished 
(45, 46). Site-specific mutagenesis in region I was 
attempted but resulted in an inactive enzyme (J. A. 
Engler, personal communication). This result, together 
with the failure to date to isolate mutants in region I of 
the HSV DNA polymerase, suggests that region I may 
be particularly critical for DNA polymerization and 
viral viability 

Mutagenesis studies of expressed recombinant DNA 
polymerases from human cells and viruses, combined 



with examination of physicochemical properties and 
molecular structure, should ultimately permit precise 
anatomic delineation and functional assignment to 
each of these regions and thereby facilitate the develop- 
ment of a comprehensive molecular understanding of 
the DNA polymerization reaction. [5] 

We thank Dr. Donald M. Cocn for a preprint of his identification 
of amino acids in HSV DNA polymerase involved in substrate and 
drug recognition and for helpful comments on this manuscript, and 
Dr. Jeffrey A. Engler for communication of his experimental results 
before publication. The studies from our laboratory were supported 
by National Institutes of Health grant CA14835 and by a gift from 
the Donald E. and Delia B. Baxter Fund. 
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